首页> 外文OA文献 >Efficient visual search of videos cast as text retrieval.
【2h】

Efficient visual search of videos cast as text retrieval.

机译:对视频进行高效的视觉搜索,将其转换为文本检索。

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We describe an approach to object retrieval which searches for and localizes all the occurrences of an object in a video, given a query image of the object. The object is represented by a set of viewpoint invariant region descriptors so that recognition can proceed successfully despite changes in viewpoint, illumination and partial occlusion. The temporal continuity of the video within a shot is used to track the regions in order to reject those that are unstable. Efficient retrieval is achieved by employing methods from statistical text retrieval, including inverted file systems, and text and document frequency weightings. This requires a visual analogy of a word which is provided here by vector quantizing the region descriptors. The final ranking also depends on the spatial layout of the regions. The result is that retrieval is immediate, returning a ranked list of shots in the manner of Google. We report results for object retrieval on the full length feature films 'Groundhog Day', 'Casablanca' and 'Run Lola Run', including searches from within the movie and specified by external images downloaded from the Internet. We investigate retrieval performance with respect to different quantizations of region descriptors and compare the performance of several ranking measures. Performance is also compared to a baseline method implementing standard frame to frame matching.
机译:我们描述了一种对象检索方法,该方法在给定对象的查询图像的情况下,搜索并定位视频中所有出现的对象。该对象由一组视点不变区域描述符表示,因此尽管视点,照明和部分遮挡发生了变化,也可以成功进行识别。镜头中视频的时间连续性用于跟踪区域,以拒绝不稳定的区域。通过采用统计文本检索中的方法(包括反向文件系统)以及文本和文档频率加权,可以实现有效的检索。这需要视觉上的单词类比,此处通过矢量量化区域描述符来提供此类单词。最终排名还取决于区域的空间布局。结果是检索是即时的,并以Google的方式返回排名的镜头列表。我们报告了在全长电影《 Groundhog Day》,《 Casablanca》和《 Run Lola Run》中对象检索的结果,包括从电影中进行搜索并由从Internet下载的外部图像指定。我们调查有关区域描述符的不同量化的检索性能,并比较几种排名度量的性能。还将性能与实现标准帧到帧匹配的基线方法进行比较。

著录项

  • 作者

    Sivic, J; Zisserman, A;

  • 作者单位
  • 年度 2009
  • 总页数
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号